Speeding Up Bayesian HMM by the Four Russians Method
نویسندگان
چکیده
Bayesian computations with Hidden Markov Models (HMMs) are often avoided in practice. Instead, due to reduced running time, point estimates – maximum likelihood (ML) or maximum a posterior (MAP) – are obtained and observation sequences are segmented based on the Viterbi path, even though the lack of accuracy and dependency on starting points of the local optimization are well known. We propose a method to speed-up Bayesian computations which addresses this problem for regular and time-dependent HMMs with discrete observations. In particular, we show that by exploiting sequence repetitions, using the four Russians method, and the conditional dependency structure, it is possible to achieve a Θ(log T ) speed-up, where T is the length of the observation sequence. Our experimental results on identification of segments of homogeneous nucleic acid composition, known as the DNA segmentation problem, show that the speed-up is also observed in practice. Availability: An implementation of our method will be available as part of the open source GHMM library from http://ghmm.org.
منابع مشابه
Speeding up the Stress Analysis of Hollow Circular FGM Cylinders by Parallel Finite Element Method
In this article, a parallel computer program is implemented, based on Finite Element Method, to speed up the analysis of hollow circular cylinders, made from Functionally Graded Materials (FGMs). FGMs are inhomogeneous materials, which their composition gradually varies over volume. In parallel processing, an algorithm is first divided to independent tasks, which may use individual or shared da...
متن کاملMCMC for non-linear state space models using ensembles of latent sequences
Non-linear state space models are a widely-used class of models for biological, economic, and physical processes. Fitting these models to observed data is a difficult inference problem that has no straightforward solution. We take a Bayesian approach to the inference of unknown parameters of a non-linear state model; this, in turn, requires the availability of efficient Markov Chain Monte Carlo...
متن کاملSpeeding up the Four Russians Algorithm by About One More Logarithmic Factor
We present a new combinatorial algorithm for Boolean matrix multiplication that runs in O(n(log log n)/ log n) time. This improves the previous combinatorial algorithm by Bansal and Williams [FOCS’09] that runs in O(n(log log n)/ log n) time. Whereas Bansal and Williams’ algorithm uses regularity lemmas for graphs, the new algorithm is simple and uses entirely elementary techniques: table looku...
متن کاملAchieving a log(n) Speed Up for Boolean Matrix Operations and Calculating the Complexity of the Dense Linear Algebra step of Algebraic Stream Cipher Attacks and of Integer Factorization Methods
The purpose of this paper is to calculate the running time of dense boolean matrix operations, as used in stream cipher cryptanalysis and integer factorization. Several variations of Gaussian Elimination, Strassen’s Algorithm and the Method of Four Russians are analyzed. In particular, we demonstrate that Strassen’s Algorithm is actually slower than the Four Russians algorithm for matrices of t...
متن کاملBitslicing and the Method of Four Russians Over Larger Finite Fields
We present a method of computing with matrices over very small finite fields of size larger than 2. Specifically, we show how the Method of Four Russians can be efficiently adapted to these larger fields, and introduce a row-wise matrix compression scheme that both reduces memory requirements and allows one to vectorize element operations. We also present timings which confirm the efficiency of...
متن کامل